Goto

Collaborating Authors

 profane word


YZR-net : Self-supervised Hidden representations Invariant to Transformations for profanity detection

Joshi, Vedant Sandeep, Tatinati, Sivanagaraja, Wang, Yubo

arXiv.org Artificial Intelligence

In the past few years due to the Covid19 pandemic the adoption of e-learning platforms has increased significantly. The widespread restrictions have forced students to continue their education via online means which causes them to spend a significant amount of their time watching videos and attending classes. This sudden change from offline to online learning has affected a lot of students therefore making an attempt to build systems that can accurately simulate the experience of offline learning can help in smoothing out this drastic transition. Live classes is one such way that gives the students a chance to escape the monotony of watching recorded videos on a daily basis. The interaction aspect of such classes allow the students to clarify small scale doubts instantaneously and at the same time gives teachers the opportunity to compliment the students on good behaviour. All these tiny bits significantly affect the learning outcome for a student by making the course content more interesting and thus improving their overall engagement on the platform. In order to mimic this offline style of interaction there can be a multitude of implementations like live polls or quizzes to check whether the student is paying attention, dynamic interactive diagrams that fuel the curiosity of students by giving them a chance to tinker with it, in-session feedback to understand the student's opinions or the in-class chats mechanism between the participants of a given session. Unlike all the other mechanisms, chats are the most open medium of communication and provide the maximum opportunity to interact with each other.


Apply profanity masking in Amazon Translate

#artificialintelligence

Amazon Translate is a neural machine translation service that delivers fast, high-quality, affordable, and customizable language translation. This post shows how you can mask profane words and phrases with a grawlix string ("?$#@$"). Amazon Translate typically chooses clean words for your translation output. But in some situations, you want to prevent words that are commonly considered as profane terms from appearing in the translated output. For example, when you're translating video captions or subtitle content, or enabling in-game chat, and you want the translated content to be age appropriate and clear of any profanity, Amazon Translate allows you to mask the profane words and phrases using the profanity masking setting.


Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language

Sharma, Arushi, Kabra, Anubha, Jain, Minni

arXiv.org Artificial Intelligence

Social media has become a bedrock for people to voice their opinions worldwide. Due to the greater sense of freedom with the anonymity feature, it is possible to disregard social etiquette online and attack others without facing severe consequences, inevitably propagating hate speech. The current measures to sift the online content and offset the hatred spread do not go far enough. One factor contributing to this is the prevalence of regional languages in social media and the paucity of language flexible hate speech detectors. The proposed work focuses on analyzing hate speech in Hindi-English code-switched language. Our method explores transformation techniques to capture precise text representation. To contain the structure of data and yet use it with existing algorithms, we developed MoH or Map Only Hindi, which means "Love" in Hindi. MoH pipeline consists of language identification, Roman to Devanagari Hindi transliteration using a knowledge base of Roman Hindi words. Finally, it employs the fine-tuned Multilingual Bert and MuRIL language models. We conducted several quantitative experiment studies on three datasets and evaluated performance using Precision, Recall, and F1 metrics. The first experiment studies MoH mapped text's performance with classical machine learning models and shows an average increase of 13% in F1 scores. The second compares the proposed work's scores with those of the baseline models and offers a rise in performance by 6%. Finally, the third reaches the proposed MoH technique with various data simulations using the existing transliteration library. Here, MoH outperforms the rest by 15%. Our results demonstrate a significant improvement in the state-of-the-art scores on all three datasets.


The Science of Swear Words (Warning: NSFW AF)

WIRED

Editor's note: The following excerpt from a book about swear words contains many, many swear words. Some of them are pretty ugly, but it's all in the name of linguistics. Many words describing sexual organs, excretory functions, and so on fail to rise to the heights (or, if you prefer, sink to the depths) of profanity. These words are articulated without fear of offending, whether in the classroom or the courtroom or the examination room. They aren't profane, despite referring to taboo concepts.